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DOUBLE -STRANDED PEPTIDE NUCLEIC ACIDS 

RELATED APPLICATION 

This patent application is related to the patent 
application entitled Higher Order Structure And Binding Of 
5 Peptide Nucleic Acids, filed herewith bearing attorney docket 
number ISIS-1052. This patent application also is a 

continuation-in-part of patent application Serial No. 
08/054,363, filed April 26, 1993, which is a continuation-in- 
part of application PCT EP92/01219, filed May 19, 1992 and 

10 published November 26, 1992 as WO 92/20702. The entire 
contents of each of the foregoing patent applications are 
incorporated herein by reference. 
FIELD OF THE INVENTION 

This invention is directed to generally linear compounds 

15 or 11 strands" wherein naturally-occurring nucleobases or other 
nucleobase -binding moieties preferably are covalently bound to 
a polyamide backbone. In particular, the invention concerns 
compounds wherein two such strands coordinate through hydrogen 
bonds to form a DNA-like double strand. 

20 BACKGROUND OF THE INVENTION 

The transcription and processing of genomic duplex DNA 
is controlled by generally proteinaceous transcription factors 
that recognize and bind to specific DNA sequences . One 
strategy for the control of gene expression is to add to a cell 

25 double -stranded DNA or double- stranded DNA-like structures that 
will bind to the desired factor in preference to or in 
competition with genomic DNA, thereby inhibiting processing of 
the DNA into a protein. This modulates the protein's action 
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within the cell and can lead to beneficial effects on cellular 
function. Naturally occurring or unmodified oligonucleotides 
are unpractical for such use because they have short in vivo 
half -lives and they are poor cell membrane penetrators. 
5 These problems have resulted in an extensive search for 

improvements and alternatives. In order to improve half -life 
as well as membrane penetration, a large number of variations 
in polynucleotide backbones has been undertaken. These 
variations include the use of methylphosphonates , phosphoro- 
10 thioates, phosphordithioates, phosphoramidates , phosphate 
esters, bridged phosphoroamidates , bridged phosphorothioates, 
bridged methylenephosphonates, dephospho internucleotide 
analogs with siloxane bridges, carbonate bridges, carboxymethyl 
ester bridges, acetamide bridges, carbamate bridges, thioether, 
15 sulfoxy, sulfono bridges, various "plastic" DNAs, a-anomeric 
bridges, and borane derivatives. The great majority of these 
backbone modifications lead to decreased stability for hybrids 
formed between the modified oligonucleotide and its 
complementary native oligonucleotide, as assayed by measuring 
2 0 T n values. 

Consequently, there remains a need in the art for stable 

compounds that can form double- stranded, helical structures 

mimicking double- stranded DNA. 

OBJECTS OP THE INVENTION 
25 it is one object of the present invention to provide 

compounds that mimic the double-helical structure of DNA. 

It is a further object of the invention to provide 

compounds wherein linear, polymeric strands coordinate through 

hydrogen bonds to form double helices. 
30 It is another object to provide compounds wherein 

naturally-occurring nucleobases or other nucleobase -binding 

moieties are covalently bound to a non-sugar-phosphate 

backbone . 

It is yet another object to provide therapeutic, 
35 diagnostic, and prophylactic methods that employ such 
compounds . 
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SUMMARY OF THE INVENTION 

The present invention provides a novel class of 
compounds, known as peptide nucleic acids (PNAs), that can 
coordinate with one another or with single- stranded DNA to form 
5 double-stranded (i.e., duplex) structures. The compounds 
include homopolymeric PNA strands and heteropolymeric PNA 
strands (e.g., DNA /PNA strands), which coordinate through 
hydrogen bonding to form helical structures. Duplex structures 
can be formed, for example, between two complementary PNA or 

10 PNA/DNA strands or between two complementary regions within a 
single such strand. 

In certain embodiments, each strand of the double- 
stranded compounds of the invention includes a sequence of 
ligands covalently bound by linking moieties and at least one 

15 of said linking moieties comprising an amide, thioamide, 
sulfinamide or sulfonamide linkage. The ligands on one strand 
hydrogen bond with ligands on the other strand and, together, 
assume a double helical structure. The compounds of the 
invention preferably comprise ligands linked to a polyamide 

20 backbone. Representative ligands include either the four main 
naturally occurring DNA bases (i.e., thymine, cytosine, adenine 
or guanine) or other naturally occurring nucleobases (e.g., 
inosine, uracil, 5 -methyl cytosine or thiouracil) or artificial 
bases (e.g., bromothymine , azaadenines or azaguanines, 5- 

25 propynyl thymine, etc.) attached to a peptide backbone through 
a suitable linker. These ligands are linked to the polyamide 
backbone through aza nitrogen atoms or through amido and/or 
ureido tethers. 

In certain preferred embodiments, the peptide nucleic 

3 0 acids of the invention have the general formula (I) : 

L 1 L 2 L n 

l< i* i- 

%> v ^V°V g2 c ^v 



(I) 
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wherein: 

n is at least 2, 

each of L x -L n is independently selected from the group 
consisting of hydrogen, hydroxy, (Cj-CJ alkanoyl, naturally 
5 occurring nucleobases, non-naturally occurring nucleobases, 
aromatic moieties, DNA intercalators , nucleobase-binding 
groups, heterocyclic moieties, and reporter ligands, at least 
one of L 1 -!/ 1 being a naturally occurring nucleobase, a non- 
naturally occurring nucleobase, a DNA intercalator , or a 

10 nucleobase-binding group; 

each of C 1 -^ is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 is 
selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R € and R 7 are 
independently selected from the group consisting of hydrogen, 

15 (C 2 -C 6 ) alkyl, aryl, aralkyl, heteroaryl, hydroxy, (C x -C 6 ) alkoxy , 
(C x -C 6 ) alkylthio, NR 3 R 4 and SR 5 , where R 3 and R 4 are as defined 
above, and R 5 is hydrogen, (C^C^) alkyl , hydroxy-, alkoxy-, or 
alkylthio- substituted (C^Cs) alkyl , or R 6 and R 7 taken together 
complete an alicyclic or heterocyclic system; 

20 each of D r -D n is (CR 6 R 7 ) 2 where R 6 and R 7 are as defined 

above ; 

each of y and z is zero or an integer from 1 to 10, the 
sum y + z being greater than 2 but not more than 10; 
each of G x -G n ' 1 is -NR 3 C0- , -NR 3 CS-, -NR 3 SO- or 
25 -NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 
each pair of A 1 -A n and B x -B n are selected such that: 
(a) A is a group of formula (Ila) , (lib) or (lie) and 
B is N or R 3 N* ; or 
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(b) A is a group of formula (lid) and B is CH; 



R 

-C- 



:ila) 



(lib) 



o 

II 

N C- 



X 

II 

-C N 



(He) 



(lid) 



5 where : 



X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 



Y is a single bond, O, S or NR 4 ; 

each of p and q is zero or an integer from 1 to 5, 
the sum p+q being not more than 10; 
10 each of r and s is zero or an integer from 1 to 5 , 

the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, (C^-C^alkyl which may be 
hydroxy- or alkoxy- or alkylthio-substituted, hydroxy, 
15 alkoxy, alkylthio, amino and halogen; 

each of G 1 -G n " 1 is -NR 3 CO- , -NR 3 CS-, -NR 3 SO- or 
-NR 3 S0 2 -, in either orientation, where R 3 is as defined above ; 

Q is -C0 2 H, -CONR'R' ' , -S0 3 H or -S0 2 NR'R' ' or an activated 
derivative of -C0 2 H or -S0 3 H; and 
20 I is -NHR' ' 'R' ' ' ' or -NR' '.' C (O) R' ' ' ' , where R' , R M , R' ' ' 

and R' ' ' ' are independently selected from the group consisting 
of hydrogen, alkyl, amino protecting groups, reporter ligands, 
intercalators , chelators, peptides, proteins, carbohydrates, 
lipids, steroids, nucleosides, nucleotides, nucleotide 
25 diphosphates, nucleotide triphosphates, oligonucleotides, 
oligonucleosides and soluble and non-soluble polymers. 
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In certain embodiments, at least one A is a group of 
formula (lie) and B is N or R 3 N* . In other embodiments, A is 
a group of formula (Ha) or (lib), B is N or R 3 N*, and at least: 
one of y or z is not 1 or 2 . 

Preferred peptide nucleic acids have general formula 
(Ilia) - (IIIc) : 




10 



(IIIc) 
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wherein: 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
5 nucleobases; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer from 1 to 60; 
10 each of k, 1, and m is independently zero or an integer 

from 1 to 5; 

p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 
R 1 is H or COCH 3 . 

15 Particularly preferred are compounds having formula (Illa)- 
(IIIc) wherein each L is independently selected from the group 
consisting of the nucleobases thymine (T) , adenine (A) , 
cytosine (C) , guanine (G) and uracil (U) , k and m are zero or 
1, and n is an integer from 1 to 30, in particular from 4 to 

20 20. 

The peptide nucleic acids of - the invention are 
synthesized by adaptation of standard peptide synthesis 
procedures, either in solution or on a solid phase. The 
synthons used are monomer amino acids or their activated 
25 derivatives, protected by standard protecting groups. The PNAs 
also can be synthesized by using the corresponding diacids and 
diamines . 

Thus, the novel monomer synthons according to the inven- 
tion are selected from the group consisting of amino acids, 
30 diacids and diamines having general formulae: 




(IV) 



(V) 



(VI) 
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wherein L, A, B, C and D are as defined above, except that any 
amino groups therein may be protected by amino protecting 
groups; E is COOH, CSOH, SOOH, S0 2 OH or an activated derivative 
thereof; and F is NHR 3 or NPgR 3 , where R 3 is as defined above 
and Pg is an amino protecting group. 

Preferred monomer synthons according to the invention 
have formula (Villa) - (VIIIc) : 



HO. 



( CH 2 Jk 



CH,) 



"N H , 



(villa) 



HO. 



( CH 



(CH 2 ) , 




CH 2 |m 



NH, 



(VHIb) 



*9 
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(VIIIC) 

or amino-protected and/or acid terminal activated derivatives 
thereof, wherein L is selected from the group consisting of 
hydrogen, phenyl, heterocyclic moieties, naturally occurring 
5 nucleobases, and non-naturally occurring nucleobases; and R 7 ' 
is selected from the group consisting of hydrogen and the side 
chains of naturally occurring alpha amino acids . 

These compounds are able to recognize one another €o 
produce double helices. Such recognition can span sequences 5- 

10 60 base pairs long. Sequences between 10 and 20 bases are of 
interest since this is the range within which unique DNA 
sequences of prokaryotes and eukaryotes are found. Sequences 
between 17-18 bases are of particular interest since this is 
the length of unique sequences in the human genome. 

15 Thus, in one aspect, the present invention provides 

methods for modulating the activity of a transcription factor 
in a cell, comprising the steps of forming a PNA- containing 
double strand that binds the transcription factor and 
introducing the double strand into the cell. 

20 Further, the invention provides methods for modulating 

the activity of a protein in a cell, comprising the steps of 
forming a PNA- containing double strand that binds to or 
suppresses expression of the protein and introducing the double 
strand into the cell. 

25 The PNA duplex structures of the invention mimic dsDNA 

and can be used in diagnostics, therapeutics and as research 
reagents and kits. They can be used in. pharmaceutical 
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compositions by including a suitable pharmaceutically accept- 
able diluent or carrier. 
BRIEF DESCRIPTION OF THE FIGURES 

The numerous objects and advantages of the present 
5 invention may be better understood by those skilled in the art 
by reference to the accompanying figures, in which: 

Figure 1 is a plot showing titration to saturation of a 
10 mer PNA to a complementary 10 mer PNA. 

Figure 2 is the development in time of the circular 

10 dichroism signal of certain compounds of the invention. 

Figure 3 is an Arrhenius plot of reaction rates at 
various temperatures during helical duplexation. 
DETAILED DESCRIPTION OF THE INVENTION 

As will be recognized, a variety of double-stranded 

15 {i.e., duplex) PNA- containing structures can be prepared 
according to the present invention. Representative duplexes 
can be formed within a single homopolymeric PNA strand or a 
single heteropolymeric strand (e.g., a chimera PNA-DNA or PNA- 
RNA strand) , or between two homopolymeric PNA strands, two 

20 heteropolymeric PNA strands, or a homopolymeric PNA strand and 
a heteropolymeric PNA strand. 

Each PNA strand or PNA portion of a chimera strand 
preferably comprises a plurality of ligands, L, linked to a 
backbone via attachment at the position found in nature, i.e., 

25 position 9 for adenine or guanine, and position 1 for thymine 
or cytosine. Alternatively, L can be a non-naturally occurring 
nucleobase (nucleobase analog) , another base-binding moiety, an 
aromatic moiety, {C X -CJ alkanoyl , hydroxy or even hydrogen. It 
will be understood that the term nucleobase includes 

30 nucleobases bearing removable protecting groups. Some typical 
nucleobase ligands and illustrative synthetic ligands are shown 
in Figure 2 of WO 92/20702. Furthermore, L can be a DNA 
intercalator, a reporter ligand such as, for example, a 
fluorophor, radio label, spin label, hapten, or a protein- 

35 recognizing ligand such as biotin. In monomer synthons, L can 
be blocked with protecting groups, as illustrated in Figure 4 
of WO 92/20702. 
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Linker A can be a wide variety of groups such as 
-CR^CO-, -CR^CS-, -CR^CSe-, -CR^CNHR 3 - , -CR 1 R 2 C=CH 2 - and - 
CR X R 2 C=C (CH 3 ) 2 - , where R 1 , R 2 and R 3 are as defined above. 
Preferably, A is methylenecarbonyl (-CH 2 CO-), amido (-CONR 3 -), 
5 or ureido ( -NR 3 CONR 3 - ) . Also, A can be a longer chain moiety 
such as propanoyl, butanoyl or pentanoyl, or corresponding 
derivative, wherein O is replaced by another value of X or the 
chain is substituted with R X R 2 or is heterogenous, containing 
Y. Further, A can be a (C 2 -C 6 ) alkylene chain, a <C 2 -C 6 ) alkylene 

10 chain substituted with R X R 2 or can be heterogenous, containing 
Y. In certain cases, A can just be a single bond. 

In one preferred form of the invention, B is a nitrogen 
atom, thereby presenting the possibility of an achiral 
backbone. B can also be R 3 N* , where R 3 is as defined above, or 

15 CH. 

In the preferred form of the invention, C is -CR 6 R 7 -, but 
can also be a two carbon unit, i.e. -CHR 6 CHR 7 - or 
-CR 6 R 7 CH 2 -, where R 6 and R 7 are as defined above. R 6 and R 7 also 
can be a heteroaryl group such as, for example, pyrrolyl, 

2 0 furyl, thienyl, imidazolyl, pyridyl, pyrimidinyl, indolyl, or 
can be taken together to complete an alicyclic system such as, 
for example, 1 , 2-cyclobutanediyl , 1 , 2-cyclopentanediyl or 1,2- 
cyclohexanediyl . 

In a preferred form of the invention, E in the monomer 

25 synthon is COOH or an activated derivative thereof, and G in 
the oligomer is -CONR 3 -. As defined above, E also can be CSOH, 
SOOH, S0 2 OH or an activated derivative thereof, whereby G in 
the oligomer becomes -CSNR 3 -, -SONR 3 -and -S0 2 NR 3 -, respectively. 
The activation can, for example, be achieved using an acid 

30 anhydride or an active ester derivative, wherein hydrogen in 
the groups represented by E is replaced by a leaving group 
suited for generating the growing backbone. 

The amino acids which form the backbone can be identical 
or different. We have found that those based on 2-aminoethyl- 

35 glycine are especially well suited to the purpose of the 
invention. 
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In some cases it may be of interest to attach ligands at 
either terminus (Q, I) to modulate other properties of the 
PNAs . Representative ligands include DNA intercalators or 
basic groups, such as lysine or polylysine . Further groups 
5 such as carboxy and sulfo groups could also be used. The 
design of the synthons further allows such other moieties to be 
located on non-terminal positions. 

Duplexes according to the present invention can be 
assayed for their specific binding activity to a transcription 
10 factor. As used herein, the term "binding affinity" refers to 
the ability of a duplex to bind to a transcription factor via 
hydrogen bonds, van der Waals interactions, hydrophobic 
interactions, or otherwise. For example a duplex can bind to 
a "leucine zipper" transcription factor or a helix-loop-helix 
15 transcription factor via positively charged amino acids in one 
region of the transcription factor. 

Transcription factors, as the term is used herein, are 
DNA- or RNA-binding proteins that regulate the expression of 
genes. HIV tat and c-rel are examples of transcription factors 
20 which regulate the expression of genes. Also encompassed by 
the term are DNA and RNA binding proteins which are not 
strictly considered transcription factors, but which are known 
to be involved in cell proliferation. These transcription 
factors include c-myc, fos, and jun. Methods of the present 
25 invention are particularly suitable for use with transcription 
factor as target molecules since transcription factors 
generally occur in very small cellular quantities. 

The compounds of the present invention also may be useful 
to bind to other target molecules. Target molecules of the 
30 present invention can include any of a variety of biologically 
significant molecules. Such other target molecules can be 
nucleic acid strands such as significant regions of DNA or RNA. 
Target molecules also can be carbohydrates, glycoproteins or 
other proteins. In some preferred embodiments of the present 
35 invention, the target molecule is a protein such as an 
immunoglobulin, receptor, receptor binding ligand, antigen or 
enzyme and more specifically can be a phospholipase, tumor 
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necrosis factor, endotoxin, interleukin, plasminogen activator, 
protein kinase, cell adhesion molecule, lipoxygenase, hydrolase 
or transacylase. In other embodiments of the invention the 
target molecules can be important regions of the human 
5 immunodeficiency virus, Candida, herpes viruses, 

papillomaviruses , cytomegalovirus , rhinoviruses , hepatitises , 
or influenza viruses. In yet other embodiments of the present 
invention the target molecules can be regions of an oncogene. 
In still further embodiments, the target molecule is ras 47-mer 

10 stem loop RNA, the TAR element of human immunodeficiency virus 
or the gag-pol stem loop of human immunodeficiency virus (HIV) . 
Still other targets can induce cellular activity. For example, 
a target can induce interferon. 

In binding to transcription factors or other target 

15 molecules, the transcription factor or other target molecule 
need not be purified. It can be present, for example, in a 
whole cell, in a humoral fluid, in a crude cell lysate, in 
serum or in other humoral or cellular extract. Of course, 
purified transcription factor or a purified form of an other 

20 target moleucle is also useful in some aspects of the 
invention. 

In still other embodiments of the present invention, 
synthetically prepared transcription factor or other target 
molecule can be useful . A transcription factor or other target 
25 molecule also can be modified, such as by biotinylat ion or 
radiolabeling. For example, synthetically prepared transcrip- 
tion factor can incorporate one or more biotin molecules during 
synthesis or can be modified post - synthesis . 

An illustrative series of PNA oligomers according to the 
3 0 invention can be prepared as described in Example 1 below and 
have been designed as follows : 

(1) Formulas 1 (SEQ ID N0:1) and 2 (SEQ ID N0:2), two 
complementary antiparallel PNA decamers that are not self- 
complementary : 

35 (aminoterminal) H - Gly - GTAGATCACT - Ly sNH 2 i 

LysNH 2 - CAT CT AGTG A - G 1 y H ( aminoterminal ) 2 

(2) Formula 3 (SEQ ID N0:3), a single PNA oligomer 
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possessing a self -complementary motif ten base pairs long with 
an intervening loop region containing five base units : 

H - G 1 y - GTAGATCACT - TT 

T 3 

5 LysNH 2 - CATCTAGTGA - TT 

(3) Formula A (SEQ ID N0:4), a single PNA oligomer 
possessing a self complementary motif of ten base pairs long 
linked by an oligomethylene (n = 1-10) spacer: 

H - G 1 y GT AG AT C ACT - (CH 2 ) n 

10 ! 1 

LysNH 2 - CATCTAGTGA- CH 2 

(4) Formula 5 (SEQ ID N0:5), a single PNA oligomer 
possessing a self -complementary motif ten base pairs long on 
one side interrupted by a three base bulge and an intervening 

15 loop region containing five base units: 

TAT 

H - G 1 yGT AGAT CACT - TT 

T 5 

LysNH 2 - CATCTA GTGA- TT 

20 In each of the foregoing, LysNH 2 is intended to indicate that 
a lysine-amide is attached to the carboxyl end of the PNA. Use 
of such lysine-amide is not necessary; however, its use is 
preferred since it is believed to suppress aggregation of the 
oligomers. The aminoterminal end of the PNA is substituted 

25 with a glycine residue to avoid migration of the N- terminal 
nucleobase . The PNA amino- terminal and carboxy- terminal ends 
are intended to correspond, respectively, to the 5 r -ends and 
3 '-ends of DNA. As a consequence of the designed sequences, 
these PNAs form duplexes having DNA- like antiparallel 

3 0 orientations. These PNAs additionally are capable of adopting 
a tertiary structure. 

As can be seen in Figure 1, the circular dichroism (CD) 
of PNA 10-mers of Example 1 are almost vanishingly small, 
indicating that there is no preferred helical stacking of 

35 bases. However, a strong CD spectrum arises upon titration of 
one 10-mer with the complementary 10-mer, a saturation obtained 
at about 1:1 stoichiometry , as shown in Figure 2. The CD 
spectrum resembles that of B-DNA, indicating a right-handed 
helix. It is believed that a PNA- PNA complex having no 
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preferred helicity initially is formed. The kinetics by which 
this double -stranded structure reorganizes into a uniform, 
right-handed double helix has been monitored and the activation 
parameters for the process determined. 
5 For DNA, circular dichroism in the nucleobase absorption 

region arises both from helical stacking of the bases, by 
excitation interactions between the neighboring bases, and from 
interactions with transitions of the chiral riboses moieties. 
In contrast, in PNA the electronic interaction between most of 

10 the bases and the chiral terminal lysine is negligible and the 
main source of circular dichroism is attributable almost solely 
to the chiral orientation of the base-pairs relative to each 
other. The right-handed helicity observed for PNA is 
determined by the chiral bias of a terminal lysine residue. 

15 The formation of a helical duplex between the two complementary 
PNA oligomers was slow enough to be followed by the increase in 
circular dichroism with time. This is in contrast with DNA-DNA 
duplex formation, which occurs within seconds. The development 
of circular dichroism follows first order reaction kinetics. 

20 Activation parameters have been determined by following the 
association at various temperatures, as- shown in Figure 3. 
Control experiments with different PNA concentrations gave 
identical results, confirming that the PNA- PNA association is 
fast and includes a reorganization process in an already base- 

25 paired complex. This observation is further supported by the 
absence of time dependent hypochromicity in normal absorption 
spectra. Such a reorganization in the corresponding cases of 
DNA-DNA and DNA -PNA decamers duplexes is too fast to be 
observed, indicating that the local chirality of the ribose in 

3 0 those cases immediately determines the handedness of base 
stacking. The conclusion is that the association of the two 
PNA-oligomers is fast in the base-pairing first step, but is 
followed by a slow seeding of the duplex chirality, from the 
terminal lysine residue, towards its right-handed helical 

35 structure. The activation energy obtained is 33.9 kJ/mole, 
which is low because the transition state is associated with a 
large negative entropy change (AS=-173 kJ/mole; see, e.g., 
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Figure 3) , implying a highly ordered transition state. This is 
a strong indication that the rate- limiting inversion step is a 
cooperative seeding of chirality from the terminal base -pairs 
involving the entire stack of bases. 
5 We believe this is the first time a pure cooperative 

inversion transition in a nucleic acid-like structure has been 
isolated. Also, in contrast to DNA wherein ribose residues act 
as local chiral centers, optical activity of PNA-PNA duplexes 
is entirely a result of the helical arrangement of the nucleo- 

10 bases relative to each other. The CD spectrum can be compared 
with the theoretical and experimental spectra that have been 
generated for helix stacks of DNA bases to demonstrate the 
mimicry of DNA duplex structure. These PNA-PNA duplexes 
therefore are useful mimics of DNA for the purpose of 

15 modulating the expression or transcription of DNA and thus 
modulating a disease state to the benefit of a living organism. 

The utility of these PNA- containing duplex structures can 
be illustrated by constructing PNA sequences which correspond 
to various sequences of the HIV TAR element that have the 

20 potential to form duplex structures either as stem- loop 
structures or two PNAs forming a duplex structure. In a 
competition assay, PNA structures that bind the tat tran- 
scription factor prevent binding of the competitor TAR sequence 
present in the incubation mixture. As the TAR RNA sequence is 

25 biotinylated only tat proteins available to bind to TAR will 
remain on the microtiter plate after washing away unbound 
molecules and tat protein complexed to a PNA sequence. The 
concentration dependence of the competition between the TAR PNA 
structures and biotinylated TAR structure will serve to define 

30 those sequences capable of effectively competing for tat and 
thus useful as HIV modulatory agents. 

Additional objects, advantages, and novel features of 
this invention will become apparent to those skilled in the art 
upon examination of the following examples thereof, which are 

35 not intended to be limiting. 
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EXAMPLE 1 

Synthesis Of PNA Structures 

PNA having formulas 1 through 5 above are prepared 
generally according to the synthetic protocols of our prior 
5 patent application WO 92/20702. Migration of the last 
nucleobase methylcarbonyl moiety to the terminal nitrogen is 
prevented by capping the N- terminus of the PNA chain with a 
glycine residue. The compounds are purified by HPLC (reverse 
phase, 0.1% trif luoroacetic acid in acetonitrile/water) and the 
10 composition verified by mass spectrometry. 
EXAMPLE 2 

Binding And Helix Formation Of Complementary Antiparallel PNA 
Strands (Figure 1) 

The circular dichroism spectra of PNA- PNA mixtures were 

15 obtained by titrating PNA having sequence H - GTAGATCACT - Ly sNH2 
(PNA formula 1) with PNA having sequence H - AGTGATCTAC - LysNH2 
(PNA formula 2) . The concentration of PNA formula 1 was held 
constant (50 jimole/L) and the concentration of PNA formula 2 
was increased to provide the following formula 2 : formula 1 

20 stoichiometries : 0.25 (Curve C) , 0.50 (Curve D) , 0.75 .{Curve 
E) , 1.00 (Curve F) , and 1.25 (Curve G) . - The hybridizations 
were performed in a 5 mmol/L sodium phosphate buffer, pH 7.0, 
at 20°C, after 20 minutes of incubation. The path length was 
1 cm. Saturation was obtained at equimolar amounts of the two 

25 decamers . 

Figure 2 shows development of negative circular dichroism 
(at 220nm) as a function of time after mixing equimolar amounts 
of PNA formula 1 with PNA formula 2. From top to bottom, the 
curves correspond to the following temperatures: 5°C, 15°C, 

30 23°C, 32°C, 41°C, and 47°C. 

Figure 3 shows an Arrhenius plot of rates from the CD 
kinetics. The plot provides the activation energy as AH=33.9 
kJ/mole (with the approximation that (k B T/h) exp ( AS */R) is 
constant). The full rate equation is k= (k B T/h) exp ( - AH') exp- 

35 USVR) then gives AS' = -173 J/mole. 
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EXAMPLE 3 

PNA Having Binding Affinity For The HIV- tat Protein As Measured 
in a Competitive Inhibition Assay 

Samples of PNAs corresponding to various TAR sequences 
5 prepared by the method of Example 1 are incubated with recombi- 
nant tat transcription factor (100/xM) for 15 minutes* at room 
temperature at 1, 3, 10 ,30, and 100/xM (see, e.g., Cullen, et 
al., Cell 1990, 63, 655.}. A competitor, a truncated version 
of the TAR sequence corresponding to residues 16-45 as a 2'-0- 

10 methyl oligonucleotide, is employed as a TAR sequence and is 
biotinylated at the 3 ' -O end by procedures generally in 
accordance with the protocols of Application Serial No. 
08/032,852, Combinatorial Oligomer Immunoabsorbant Screening 
Assay For Transcription Factors And Other Biomolecule Binding, 

15 filed March 16, 1993, the entire contents of which are 
incorporated herein by reference. This TAR sequence is added 
at lOOnM concentration. The reaction is incubated for 20 
minutes and then added to streptavidin-coated microtiter plate 
wells. After unbound molecules are washed away with phosphate- 

20 buffered saline (PBS) , 100^L of 1:500 tat antisera is added to 
each well and incubated for 2 hours. Protein A/G antisera 
phosphatase is bound to the tat antibodies and PNPP (p- 
nitrophenylphosphate) substrate (200 /xl) then is added. Color 
development is measured 2 hours later by reading absorbance at 

25 4 05 nM on a Titertek Multiscan ELISA plate reader. 
EXAMPLE 4 

PNA Having Binding Affinity For The C-sayc Protein 

Myc-c is a nuclear protein involved in cell prolifera- 
tion, differentiation, and neoplastic disease and binds DNA in 

30 a sequence specific manner. See, e.g., Nissen, Cancer Research 
1986, 46, 6217 and Blackwell, Science 1990, 250, 1149. Crude 
nuclear extracts of myc-c are prepared generally in accordance 
with Franza, et al., Mature 1987, 330, 391, from HL 60 cells 
stimulated to induce the expression of myc-c. 

35 Phosphorothioate oligonucleotides having the sequences 

GAT CCC CCC ACC ACG TGG TGC CTG A-B (SEQ ID NO : 6 ) and GAT CTC 
AGG CAC CAC GTG GTG GGG G-B (SEQ ID NO: 7) , where B=biotin, are 
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synthesized on an automated DNA synthesizer (Applied Biosystems 
model 380B) using modified standard phosphoramidite chemistry 
with oxidation by a 0.2M solution of 3H- 1 , 2 -benzodithiole- 3 -one 
1,1-dioxide in acetonitrile for stepwise thiation of phosphite 
5 linkages. The thiation cycle wait step is 68 seconds and is 
followed by the capping step. /?-Cyanoethyldiisopropyl 
phosphoramidites can be purchased from Applied Biosystems 
(Foster City, CA) . Bases are deprotected by incubation in 
methanolic ammonia overnight. Following base deprotection, the 

10 oligonucleotides are dried in vacuo. Removal of 2'-hydroxyl t- 
butyldimethylsilyl protecting groups is effected by incubating 
the oligonucleotide in 1M tetrabutylammonium fluoride in 
tetrahydrof uran overnight. The RNA oligonucleotides are 
further purified on C 18 Sep-Pak cartridges (Waters, Division of 

15 Millipore Corp., Milford, MA) and ethanol precipitated. The 
phosphorothioate oligonucleotides are hybridized to create the 
double stranded NF-kB binding site. 

A series of PNA-PNA duplexes is synthesized and 
hybridized to give a new series of PNA duplexes corresponding 

20 to different length portions of the myc-c binding sequence. 
Each duplex is incubated in triplicate at concentrations of 1, 
3, 10, 30, and 100 /xM with the HL-60 extract described above. 
The myc P=S binding site then is added and the mixtures are 
incubated and washed with PBS. An antibody directed to the 

25 leucine zipper region of the myc protein (Santa Cruz 
Biotechnology) is added at a 1:1000 dilution. Non-bound 
molecules are washed away with PBS. Binding of myc to 
biotinylated c-myc transcription factor is quant itated by 
adding 100 /il of 1:500 tat antisera to each well for 2 hours. 

30 Protein A/G-alkaline phosphatase (Pierce; 1:5000; 100 jil) then 
f is added and any excess is removed by washing with PBS. PNPP 
substrate (200 /zl) then is added. Color development is 
measured 2 hours later by reading absorbance at 4 05 nM on a 
Titertek Multiscan ELISA plate reader. 

3 5 EXAMPLE 5 

PNA Having Binding Affinity For The C-rel Transcription Factor 
C-rel has been shown to represent a constituent of the 
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NF-kB site binding transcription factor, which plays a crucial 
role in the expression of a number of genes including the 
immunoglobulin k light chain gene, IL-2ra, and MHC. (see, e.g., 
Gilmore, et al . , Cell 1986, 62, 791.) 
5 Crude nuclear extracts are prepared as detailed by 

Franza, et al . , Nature 1987, 330, 3 91, from Jurkat cells 
stimulated 4 hours with 1 fiM PHA and 100 nM PMA to induce the 
expression of rel. The extract is then preabsorbed with 100 /xl 
streptavidin agarose per ml for 10 minutes. This is followed 

10 with the addition of poly dl . dC as a nonspecific competitor at 
a concentration of 100 fig/ml of extract. Nuclear extracts 
containing the biotinylated NF-kB binding site competitor are 
prepared as in Example 4, above. 

A series of PNA duplexes is synthesized to correspond to 

15 various length fragments of the consensus binding sequence of 
c-rel. NF-kB binding site competitor is added to each duplex 
and the resulting samples are washed. Antibody directed to rel 
is added. The amount of rel bound is quantitated by adding 100 
pi of 1:500 rel antisera to each well for 2 hours. Protein 

20 A/G-alkaline phosphatase (Pierce; 1:5000; 100 fil) the is added 
and any excess is removed by washing with* PBS . PNPP substrate 
(200 /xl) then is added. Color development is measured- 2 hours 
later by reading absorbance at 4 05 nM on a Titertek Multiscan 
ELISA plate reader. 

25 EXAMPLE 6 

PH& Having Binding Affinity For The AP-1 Transcription Factor 
Genes belonging to tl^e fos and jun oncogene families 
encode nuclear proteins associated with a number of transcrip- 
tional complexes, see, e.g., Konig, et al . , EMBO Journal 1989, 

30 8, 2559. C-jun is a major component of the AP-1 binding site, 
which was originally shown to regulate tissue plasminogen 
activator (TPA) induced expression of responsive genes through 
the TPA response element (TRE) . The jun protein forms homo- or 
heterodimers which bind the TRE. The fos protein is only 

35 active as a heterodimer with any of the jun family of proteins. 
Fos/jun heterodimers have a much higher affinity for the TRE 
than jun homodimers . 
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Both the fos and the jun cDNA have been cloned downstream 
of the Sp6 promoter. RNA is produced from each plasmid in 
vitro, then used to produce functional jun and fos proteins in 
rabbit reticulocyte lystates . The fos and jun proteins are 
5 then allowed to bind to the biotinylated AP-1 binding site in 
competition with PNA duplex sequences constructed as mimics of 
the proper consensus sequence for binding fos and jun, CGC TTG 
GTG ACT CAG CCG GAA. Binding is quant itated with an antibody 
directed to fos or jun. When the fos alone is incubated with 

10 the AP-1 site there will be no detectable binding with either 
antibody. When the jun alone is incubated with the binding 
site, a signal will be detected with only the jun antibody. 
This is consistent with the formation of a jun homodimer, which 
has previously been demonstrated to bind AP-1. When the fos 

15 and jun proteins are mixed a signal will be detected with both 
fos and jun antibodies. This is consistent with the formation 
of a fos/jun homodimer which is known to bind the AP-1 site and 
should be detectable with either antibody. 

PNA sequences of the present invention can be tested for 

20 the ability to block the formation of the fos/jun heterodimer . 
Molecules which block formation will decrease the signal 
detected with the fos antibody, but not the jun antibody. 
EXAMPLE 7 

Chimera Macromolecule Having Peptide Nucleic Acids Section 
25 Attaching to 3' Terminus of a 2'-Deoaey Phosphorothioate 
Oligonucleotide Section 

A first section of peptide nucleic acids is prepared as 
per PCT patent application WO 92/20702. The peptide nucleic 
acids are prepared from the C terminus towards the N terminus 
30 using monomers having protected amino groups. Following 
completion of the peptide region, the terminal amine blocking 
group is removed and the resulting amine reacted with a 3'-C- 
(formyl) -2' , 3' -dideoxy-5' -trityl nucleotide as prepared per the 
procedure of Vasseur, et. al . , J. Am. Chem. Soc. 1992, 124, 
35 4006. The condensation of the amine with the aldehyde moiety 
of the C-formyl nucleoside is effected as per the conditions of 
the Vasseur, ibid, , to yield an intermediate imine linkage. 
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The imine linkage is reduced under reductive the alkylation 
conditions of Vasseur, ibid. , with HCHO/NaBH 3 CN/AcOH to yield 
the nucleoside connected to the peptide nucleic acid via an 
methyl alkylated amine linkage. An internal 2'-deoxy 

5 phosphorothioate nucleotide region is then continued from this 
nucleoside as per standard automatated DNA synthetic protocols 
(see Oligonucleotide synthesis, a practic approach, M.J. Gait 
ed, IRL Press, 1984) . 
EXAMPLE 8 

10 Chimera Macromolecule Having Peptide Nucleic Acids Section 
Attaching to 5' Terminus of a Phosphorothioate Oligonucleotide 
Section 

A phosphorothioate oligonucleotide is prepared in the 
standard manner on a solid support as per standard protocols 

15 (see Oligonucleotides and Analogues, A Practical Approach, F. 
Eckstein Ed. , IRL Press, 1991. The dimethoxytrityl blocking 
group on that nucleotide is removed in the standard manner. 
Peptide synthesis for the peptide region is commenced by 
reaction of the carboxyl end of the first peptide nucleic acid 

20 of this region with the 5' hydroxy of the last nucleotide of 
the DNA region. Coupling is effected via EDC (Pierce) in 
pyridine to form an ester linkage between the peptide and the 
nucleoside. Peptide synthesis is then continued in the manner 
of patent application WO 92/20702 to complete the peptide 

25 nucleic acid region. 
EXAMPLE 9 

Double Stranded Structures That Include Chimera Strand 

Duplex structures will be formed with the chimera strands 
of Examples 7 and 8 . Duplex structures can include duplexes 
3 0 between a PNA-RNA or PNA-DNA strand and a RNA strand, a PNA-RNA 
or PNA-DNA strand and a DNA strand, a PNA-RNA or PNA-DNA strand 
and a PNA strand or a PNA-RNA or PNA-DNA strand and a further 
chimeric PNA-DNA or PNA-RNA strand. 
EXAMPLE 10 

3 5 Binding between PNA containing double stranded structure and 
transcription factor or other protein 

A double stranded PNA structure, a structure containing 
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PNA chimeric strand and a nucleic acid strand or two PNA 
chimera strands will be used to bind to or otherwise modulate 
single stranded DNA, double stranded DNA, RNA, a transcription 
factor or other protein. In the use of a PNA containing 
5 chimera, part of the binding between the chimera and the 
transcription factor or other protein can include binding 
between the sugar -phosphate backbone of the DNA or RNA portion 
of the chimera and hydrogen bonding between the ligands, e.g. 
nucleobases, of the PNA portion of the chimera. Binding to the 

10 sugar-phosphate backbone includes binding to phosphodiester 
linkages, phosphorothioate linkages or other linkgages that may 
be used as the bacbone of the DNA or RNA. In other instances, 
bonding can include hydrophobic contacts between hydrophobic 
groups on the ligands, including nucleobases, of the PNA or the 

15 nucleobases of the nucleic acid portion of the chimera with 
like hydrophobic groups on proteins that are being bound.- Such 
hydrophobic groups on the chimeric strand include the methyl 
groups on thymine nucleobases . 

Those skilled in the art will appreciate that numerous 

20 changes and modifications can be made to the preferred 
embodiments of the invention and that such changes and 
modifications can be made without departing from the spirit of 
the invention. It is therefore intended that the appended 
claims cover all such equivalent variations as fall within the 

25 true spirit and scope of the invention. 
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WHAT IS CLAIMED IS: 

I. A compound comprising a first polymeric strand and 
a second polymeric strand, wherein: 

each of said strands includes a sequence of ligands 
5 covalently bound by linking moieties, at least one of said 
linking moieties comprising an amide, thioamide, sulfinamide 
or sulfonamide linkage; and 

a plurality of ligands on said first strand hydrogen 
bond with ligands on said second strand. 
10 2, The compound of claim 1 wherein said first and 

second strands are covalently bound. 

3. The compound of claim 1 wherein said first and 
second strands are not covalently bound. 

4 . The compound of claim 1 wherein said first strand 
15 and said second strand hydrogen bond to form a double strand. 

5 . The compound of claim 1 wherein said double strand 
is a right-handed double strand. 

6. The compound of claim 1 wherein at least a portion 
of said ligands are selected from naturally occurring 

20 nucleobases and non-naturally occurring nucleobases. 

7. The compound of claim 6 wherein said naturally 
occurring nucleobases include purine nucleobases and pyrimi- 
dine nucleobases . 

8. The compound of claim 7 wherein said purine and 
25 pyrimidine nucleobases include adenine, guanine, thymine, 

uridine and cytosine. 

9. The compound of claim 1 wherein at least two of 
said linking moieties each have amino ends and carboxyl ends 
and said linking moieties are covalently bound via amide 

30 linkages. 

10 . The compound of claim 9 wherein each of said 
linking moieties includes a nitrogen atom between said amino 
end and said carboxyl end. 

II. The compound of claim 10 wherein each of said 
35 ligands is connected to said linking moieties via said 

nitrogen atoms . 
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12 . The compound of claim 1 wherein at least one of 
said strands has the formula: 



5 




wherein : 

10 n is at least 2, 

each of L r -L n is independently selected from the group 
consisting of hydrogen, hydroxy, (C^-CJ alkanoyl , naturally 
occurring nucleobases, non-naturally occurring nucleobases, 
aromatic moieties, DNA intercalators, nucleobase -binding 

15 groups, heterocyclic moieties, and reporter ligands; 

each of C^-C" is (CR 6 R 7 ) y where R € is hydrogen and R 7 is 
selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R 6 and R 7 are 
independently selected from the group consisting of hydrogen, 

20 (C 2 -C 6 ) alkyl , aryl, aralkyl, heteroaryl, hydroxy, (C x - 
CJalkoxy, (C x -C 6 ) alkylthio, NR 3 R 4 and SR S , where R 3 and R 4 are 
as defined above, and R 5 is hydrogen, (C^-Cg) alkyl , hydroxy-, 
alkoxy- , or alkylthio- substituted (C^-Cg) alkyl , or R 6 and R 7 
taken together complete an alicyclic or heterocyclic system ; 

25 each of D x -D n is (CR 6 R 7 ) Z where R 6 and R 7 are as defined 

above; 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 2 but not more than 10; 
each of G^G"' 1 is -NR 3 C0- , -NR 3 CS-, -NR 3 SO- or 
30 -NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 
each of A x -A n and B 1 -B n are selected such that: 
(a) A is a group of formula (Ila) , (lib) or (lie) , 
and B is N or R 3 N + ; or 



WO 95/01369 



PCT/IB94/00211 



10 



15 



20 



25 



30 



35 



- 26 - 

(b) A is a group of formula (lid) and B is CH; 

X 

— c 
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where : 



(lie) 



R* r 



(lid) 



X 

II 

-C H 



X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 



2 / 



40 



Y is a single bond, O, S or NR 4 ; 

each of p and q is zero or an integer from 1 to 
5, the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 to 
5, the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, ((^-CJalkyl which may be 
hydroxy- or alkoxy- or alkylthio- substituted, hydroxy, 
alkoxy, alkylthio, amino and halogen; and 

each R 3 and R 4 is independently selected from the 
group consisting of hydrogen, (CVCJ alkyl , hydroxy- or 
alkoxy- or alkylthio- substituted (C x -C 4 ) alkyl , hydroxy, 
alkoxy, alkylthio and amino; 

Q is -C0 2 H, -CONR'R", -S0 3 H or -S0 2 NR'R'' or an 
activated derivative of -C0 2 H or -S0 3 H; and 

I is -NHR'"R"" or -NR" ' C (O) R" ' ' , where R' , R" , 
R' ' ' and R" " are independently selected from the group 
consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators, chelators, peptides, 
proteins, carbohydrates, lipids, steroids, nucleosides, 
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nucleotides, nucleotide diphosphates, nucleotide 
triphosphates, oligonucleotides, oligonucleosides and soluble 
and non-soluble polymers. 

13 . The compound of claim l wherein each of said first 
5 and second polymeric strands comprises a moiety of the 
formula : 




or the formula 




or the formula 




10 
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wherein : 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
5 nucleobases; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer greater than 1, 
10 each k, 1, and m is, independently, zero or an integer 

from 1 to 5 ; 

each p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 

R 1 is H or C0CH 3 . 
15 14 . A process for preparing a double -stranded 

structure, comprising the steps of: 

providing a first strand and a second strand, each 
strand including a sequence of ligands covalently bound by 
linking moieties, said linking moieties comprising at least 
20 one amide, thioamide, sulfinamide or sulfonamide linkage; and 

disposing said strands in space relative to one 
another to form hydrogen bonds therebetween. 

15 . The process of claim 14 wherein said hydrogen 
bonds are formed between said ligands on said first strand and 
25 said ligands on said second strand. 

16- The process of claim 14 wherein at least a portion 
of said ligands are selected from naturally occurring 
nucleobases and non-naturally occurring nucleobases. 

17. The process of claim 16 wherein said naturally 
30 occurring nucleobases are selected to be complementary to 

nucleobases in a predetermined DNA double strand. 

18. The process of claim 14 further comprising 
annealing said first strand and said second strand. 

19. The process of claim 14 wherein said first strand 
35 and said second strand are covalently bound. 

20. The process of claim 14 wherein said first strand 
and said second strand are not covalently bound. 
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21. The process of claim 14 wherein at least two of 
said linking moieties in a strand have both amino ends and 
carboxyl ends and said linking moieties are covalently bound 
via amide linkages. 
5 22. The process of claim 21 wherein each of said 

linking moieties includes a nitrogen atom between said amino 
end and said carboxyl end. 

23 . The process of claim 22 wherein each of said 
ligands is connected to said linking moieties via said 
10 nitrogen atoms . 

24 . The process of 14 wherein each of said first and 
second polymeric strands comprises a moiety of the formula: 

IS L 1 L 2 L" 

A< A 2 A" 



wherein: 

25 n is at least 2, 

each of L l -L n is independently selected from the group 
consisting of hydrogen, hydroxy, (C x -C 4 ) alkanoyl , naturally 
occurring nucleobases, non-naturally occurring nucleobases, 
aromatic moieties, DNA intercalators, nucleobase -binding 

30 groups, heterocyclic moieties, and reporter ligands; 

each of C^C" is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 is 
selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R € and R 7 are 
independently selected from the group consisting of hydrogen, 

35 (C 2 -C 6 ) alky 1, aryl, aralkyl, heteroaryl, hydroxy, (C x - 
C € )alkoxy, (C^-CJ alkylthio , NR 3 R 4 and SR S , where R 3 and R 4 are 
as defined above, and R 5 is hydrogen, (C^-Cg) alkyl , hydroxy-, 
alkoxy- , or alkylthio- substituted (C^-Cg) alkyl , or R € and R 7 
taken together complete an alicyclic or heterocyclic system ; 

40 each of D x -D n is (CR 6 R 7 ) 2 where R 6 and R 7 are as defined 

above ; 



WO 95/01369 



PCT/EB94/00211 



- 30 - 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 2 but not more than 10; 

each of G'-G" -1 is -NR 3 CO-, -NR 3 CS-, -NR 3 S0- or 
-NR 3 S0 2 - / in either orientation, where R 3 is as defined above; 
5 each of A 1 -A n and B x -B n are selected such that: 

(a) A is a group of formula (Ila) , (lib) or (lie) , 
and B is N or R 3 N* ; or 

(b) A is a group of formula (lid) and B is CH; 
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where : 



30 



35 



40 



(He) 



(lid) 



X is O, S, Se, NR\ CH, or C(CH 3 ) 



2 / 



Y is a single bond, 0, S or NR 4 ; 

each of p and q is zero or an integer from 1 to 
5, the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 to 
5, the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, (C^-CJ alkyl which may be 
hydroxy- or alkoxy- or alkylthio-substituted, hydroxy, 
alkoxy, alkyl thio, amino and halogen; and 

each R 3 and R 4 is independently selected from the 
group consisting of hydrogen, (C^CJ alkyl , hydroxy- or 
alkoxy- or alkylthio-substituted (CVC) alkyl, hydroxy, 
alkoxy, alkylthio and amino; 
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Q is -C0 2 H, -CONR'R'', -S0 3 H or -S0 2 NR'R'' or an 
activated derivative of -C0 2 H or -S0 3 H; and 

I is -NHR'''R f/// or -NR' ' ' C (O) R' ' ' ' , where R' , R" , 
R' ' ' and R' ' ' ' are independently selected from the group 
5 consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators , chelators, peptides, 
proteins, carbohydrates, lipids, steroids, nucleosides, 
nucleotides, nucleotide diphosphates, nucleotide 
triphosphates, oligonucleotides, oligonucleosides and soluble 
10 and non-soluble polymers. 

25. The process of claim 14 wherein each of said first 
and second polymeric strands comprises a moiety of the 
formula : 





15 or the formula 
wherein: 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
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nucleobases; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer greater than 1, 

each k, 1, and m is, independently, zero or an integer 
from 1 to 5; 

each p is zero or l; 

R h is OH, NH 2 or -NHLysNH 2 ; and 

R 1 is H or COCH 3 . 

26. A process for modulating the activity of a 
transcription factor in a cell, comprising the steps of: 
forming a double -stranded structure by: 

providing a first strand and a second 
strand, each strand including a sequence of 
ligands covalently bound by linking moieties, 
wherein said linking moieties comprise at least 
one amide, thioamide, sulfinamide or sulfonamide 
linkage, at least a portion of said ligands bind 
said transcription factor; and 

disposing said strands in space relative to 
one another to form hydrogen bonds therebetween 
and thereby form a double strand; and 
introducing said double- stranded structure into said 



25 cell, 



27. The process of claim 26 wherein at least two of 
said linking moieties in a strand have both amino ends and 
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carboxyl ends and said linking moieties are covalently bound 
via amide linkages. 

28. The process of claim 27 wherein each of said 
linking moieties includes a nitrogen atom between said amino 

5 end and said carboxyl end. 

29. The process of claim 28 wherein each of said 
ligands is connected to said linking moieties via said 
nitrogen atoms . 

30. A process for modulating the activity of a protein 
10 in a cell, comprising the steps of: 

forming a double -stranded structure by: 

providing a first strand and a second 
strand, each strand including a sequence of 
ligands covalently bound by linking moieties, 
15 wherein said linking moieties comprise at least 

one amide, thioamide, sulfinamide or sulfonamide 
linkage, at least a portion of said ligands bind 
said protein; and 

disposing said strands in space relative to 
20 one another to form hydrogen bonds therebetween 

and thereby form a double strand; and 
introducing said double -stranded structure into said 

cell. 

31. The process of claim 30 wherein at least two of 
25 said linking moieties in a strand have both amino ends and 

carboxyl ends and said linking moieties are covalently bound 
via amide linkages . 

32. The process of claim 30 wherein each of said units 
includes a nitrogen atom between said amino end and said 

3 0 carboxyl end. 

33. The process of claim 30 wherein each of said 
ligands is connected to said linking moieties via said 
nitrogen atoms . 

34. The process of claim 3 0 wherein said protein is 
3 5 a DNA binding protein. 
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